Supporting fault-tolerant distributed computations under real-time requirements
نویسندگان
چکیده
In contrast to conventional (trans)action concepts the proposed dynamic action model includes the possibility for optimistic recovery in order to gain high efficiency during normal operation. To minimize time overhead we use a redundant recovery graph to record the necessary recovery information. Based on this graph we provide decentral ized protocols that efficiently produce a consistent system state concurrent to normal system activity. Considering real-time applications in distributed systems, error processing time has to be minimized. To achieve this, the proposed concept is extended to the parallel dynamic action scheme where the different versions are executed in parallel. This leads to a recovery concept that combines efficient distributed processing during normal operation and prompt reaction in case of an error .
منابع مشابه
Distributed and fault-tolerant thread management for parallel computations
Application of distributed computations becomes more attractive with the advances in computer networks. Lemick is an environment for automation of real-time distributed programming expressed in the terms of more traditional multithreaded programming. This paper gives a brief introduction into the implementation of Lemick distributed threads. A focus is made on the mechanisms employed to support...
متن کاملFault-Tolerant Static Scheduling for Real-Time Distributed Embedded Systems
We present in this paper a heuristic for producing automatically a distributed fault-tolerant schedule of a given data-flow algorithm onto a given distributed architecture. The faults considered are processor failures, with a failsilent behavior. Fault-tolerance is achieved with the software redundancy of computations and the time redundancy of data-dependencies.
متن کاملFault Tolerance in a Multi-Layered DRE System: A Case Study
Dynamic resource management is a crucial part of the infrastructure for emerging distributed real-time embedded systems, responsible for keeping mission-critical applications operating and allocating the resources necessary for them to meet their requirements. Because of this, the resource manager must be fault-tolerant, with nearly continuous operation. This paper describes our efforts to deve...
متن کاملModel-based development of fault-tolerant real-time systems
The design of fault-tolerant real-time systems is a complex task. The system must not only satisfy real-time requirements, but it must also deliver the specified functionality in the presence of both hardware and software faults. To achieve fault-tolerance, the system has to use redundancy. This redundancy is usually achieved by replicating hardware units and executing the application within a ...
متن کاملImproving Performance in Adaptive Fault Tolerance Structure with investigating the effect of the number of replication
Regarding the wide use of distributed systems in various areas, having a system with fault tolerance ability would be an import characteristic. And in designing the real time distributed systems, this seems to be more considerable. With regard using some middleware like CORBA in designing such systems, and in order to increase their compatibility, speed, performance, to simplify the network pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Communications
دوره 15 شماره
صفحات -
تاریخ انتشار 1992